Ontology-driven discourse analysis for information extraction

نویسندگان

  • Philipp Cimiano
  • Uwe Reyle
  • Jasmin Saric
چکیده

This paper presents a novel approach to discourse analysis within information extraction systems. It makes use of DRT as formal representation of the linguistic context as well as of a domain-specific ontology as a basis to compute conceptual relations between extracted events thus establishing discourse coherence. The approach has been implemented within GenIE, an information extraction system with the aim of extracting information about biochemical pathways, about sequences, structures and functions of genomes and proteins. The approach is evaluated against a semantically hand-annotated set of Swiss-Prot protein function descriptions and shows very promising results. 2004 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ontology-Driven Discourse Analysis in GenIE

This paper presents a novel approach to discourse analysis within information extraction systems. It makes use of DRT as formal representation of the linguistic context as well as of a domain-specific ontology as a basis to compute conceptual relations between extracted events thus establishing discourse coherence. The approach has been implemented within GenIE, an information extraction system...

متن کامل

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...

متن کامل

A protocol for constructing a domain-specific ontology for use in biomedical information extraction using lexical-chaining analysis

In order to do more semantics-based information extraction, we require specialized domain models. We develop a hybrid approach for constructing such a domain-specific ontology, which integrates key concepts from the protein-protein– interaction domain with the Gene Ontology. In addition, we present a method for using the domain-specific ontology in a discourse-based analysis module for analyzin...

متن کامل

Automatic Annotation of Discourse and Semantic Relations Supplemented by Terminology Extraction for Domain Ontology Building and Information Retrieval

In this article, we develop a framework for the building of domain ontologies and a semantic index based on two technologies: terminology extraction with LEXTER (© EDF R&D) and discourse and semantic annotation with EXCOM. We have selected two specific points of view for this study: causality and part-whole notions. In the first part of this paper, we explain the contributions of a terminology ...

متن کامل

Ontology-Driven Information Systems: Challenges and Requirements

The increased use of ontologies in several application fields makes it possible to observe requirements for their smooth integration within Information Systems. In this paper we analyse these requirements and propose the usage of additional semantic knowledge in the ontology to reconcile them. We think that these properties are essential to enhance the performance of ontology-driven Information...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Data Knowl. Eng.

دوره 55  شماره 

صفحات  -

تاریخ انتشار 2005